01:32
2026-07-04
dev.to
artificial-intelligence
David Just Beat Goliath on Terminal-Bench 2.1
Backboard R-CLI, a small open-source terminal agent, achieved the #1 published score on Terminal-Bench 2.1 with 84.3% accuracy (75/89 tasks), beating larger competitors like Codex CLI and Claude Code.โฆ